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Abstract. We give improved lower bounds for the number of solutions of some S- 
unit equations over the integers, by counting the solutions of some associated linear 
equations as the coefficients in those equations vary over sparse sets. This method 
is quite conceptually straightforward, although its successful implementation involves, 
amongst other things, a slightly subtle use of a large sieve inequality. We also present 
two other results about solving linear equations on average over their coefficients. 



1. Introduction 

In this paper we will give bounds for the number of solutions of some S"-unit equations 
over the integers. That is, we will investigate the integer solutions of equations such as 

a + 1 = c, 

where all prime factors of the variables are required to lie in some given finite set S. 
(One can equally well speak of 5*-unit equations in any algebraic number field, with S 
being a set of primes of the field, but we shall not consider the more general situation.) 
Throughout we will use the letter s to denote the cardinality of the relevant set S. 

S'-unit equations have been studied fairly intensively for at least thirty years. For 
example, in 1984 Evertse [TU] used Diophantine approximation methods to give upper 
bounds, depending only on s (and, in the number field case, also the degree of the field), 
for the number of S"-unit solutions to equations like a + 1 = c. In 1988, Erdos, Stewart 
and Tijdeman [9] used methods from combinatorial number theory, in particular some 
results they proved about the largest prime factor of a product of sums, to show the 
existence of sets S for which a + b = c has many S"-unit solutions with a, b, c coprime 
integers. In 2003, Evertse, Moree, Stewart and Tijdeman [11] gave an extension of 
Erdos, Stewart and Tijdeman's result to weighted S'-unit equations in n variablea§ (as 
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well as some other results). Many more upper bound results are now known, and there 
has also been much work in other directions. 

In their 2007 paper [T7j, Konyagin and Soundararajan used a nice and simple combi- 
natorial argument to improve the lower bound result of Erdos, Stewart and Tijdeman [9], 
and a more involved argument, requiring lower bounds for the quantity of very smooth 
number^ in certain arithmetic progressions, to show the existence of sets S for which 
a + 1 = c has many S-unit solutions. Finally we mention two recent papers [18j [19] 
of Lagarias and Soundararajan, in which the Generalised Riemann Hypothesis is used 
(powering a version of the circle method) to give lower bounds for the number of co- 
prime S-unit solutions to a + b = c, in the special case where S is fixed as the set of the 
first s prime numbers. 

In this paper we will give lower bounds for the number of solutions of some S-unit 
equations, using general results about character sums and Kloosterman sums. 

Before stating our results, we explain a general strategy for exhibiting many solutions 
to an S'-unit equation. The work of Erdos, Stewart and Tijdeman [9J, of Evertse, Moree, 
Stewart and Tijdeman [11] . and of Konyagin and Soundararajan [T7] can be seen to fit 
into this strategy, and our approach is also based upon it. 

It seems a natural guess that, for given s, an S'-unit equation such as a + b = c will 
have most coprime solutions (at least roughly) if S consists of the first s primes. For 
example, one might imagine that the fine distribution of S-units looks fairly random, 
subject to global constraints on their density, and so one is most likely to find solutions 
when the S-units are as "concentrated" as possible (i.e. when the elements of S are 
small). It is reasoning of this kind that gives rise to many conjectures about the number 
of solutions to S-unit equations. More precisely, it is known that for any fixed a > 1, 

#{n < x : if prime p \ n then p < log a x} = ^"Va+oC 1 ) asx-^ oo. 

See §2 for a proof of a lower bound of this kind. If we choose a = 3/2 + e, and let x be 
large, then§ the right hand side is a; 1 / 3 + n ( <: )+ ( 1 ) ) an d we might imagine that 

#{(a, b, c) 6 Z 3 : gcd(a, b, c) = 1; a + b = c; if prime p \ abc then p < log 3//2+e x} 

^ x l/3+fi(e)+o(l) . a ,l/3+n(e)+o(l) . x l/3+Q(e)+o(l) j % = x <n(e)+o(l) ^ 

which is > e ( lo g 3/2+E a; ) 2/3 \ The conjectured lower bound for the maximum number of 
coprime S-unit solutions to a + b = c is, accordingly, e s . See e.g. Konyagin and 
Soundararajan's paper [T7] for further calculations of this kind. 



2 A number is said to be y-smooth if all of its prime factors are at most y. By "very smooth" we mean 

"j/-smooth, with y very small relative to other quantities in the argument" . 

3 As is usual, we employ the notation f2(e) to denote any function of e that is e. 
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A standard approach for counting solutions to equations whose variables lie in com- 
plicated sets is the circle method. However, it is well known that this almost never 
works for equations with only two free variables, and for equations with three variables 
one would usually work with sets having density at least \og~ c x, say, for C some pos- 
itive constant. In contrast, we would want to work with sets having density more like 
x~ x l 2 or x~ 2 / 3 , and about which we also don't have much Fourier-analytic information. 
In their papers [EJ[T9], Lagarias and Soundararajan assume the Generalised Riemann 
Hypothesis to obtain suitable information. 

We will avoid these difficulties by giving up on the idea of counting solutions for S 
that is fixed in advance, and instead allow ourselves to vary S a little in response to 
things that happen in our arguments. In other words, we will start out with a set S', 
and will find lots of "almost solutions" to the equation we are interested in whose prime 
factors come from S' . Then we will add a few extra primes into S', forming a set S, 
in such a way that quite a few of the almost solutions are turned into actual solutions 
with prime factors from S. Thus when Erdos, Stewart and Tijdeman [9] were seeking 
many coprime solutions to a + b = c, they took S ; to consist of all primes up to some 
point, and used their results about prime factors of sums to show the existence of a 
fixed (and not too large) b such that a + b = c has many solutions in a, c having all 
their prime factors in S'. Then they set S = S' U {p : p prime, p \ b}. Konyagin and 
Soundararajan [17] studied the S- unit equation a + b = c by studying the solutions of 

c — b = au, 

where (roughly speaking) a, b, c had only small prime factors (from S'), and u was any 
small integer. They showed the existence of so many solutions that some "popular" u 
had to occur in many of them, and then set S = S' U {p : p prime, p \ u}. Similarly, 
they obtained S'-unit solutions of a + 1 = c by studying solutions of 

c — 1 = au. 

We will study the 5-unit equation a + 1 = c by studying the solutions, in smooth 
numbers a, c and integers u, w from certain intervals (roughly speaking), of the equation 

au + 1 = cw. 

Similarly we will study the 5-unit equation a + 6 + l = cby studying the solutions of 
au + b + 1 = cw, with a, b, c smooth numbers and u, w integers of small modulus. 

As a warm-up, in §2 we shall prove the following result: 
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Proposition 1 (Erdos, Stewart and Tijdeman, 1988). There are arbitrarily large sets 
S of prime numbers such that 

#{(a, b, c) G Z 3 : gcd(a, b, c) = 1; a+b = c; if prime p | abc then p G S} > e (1/W W s / lo s s , 
where s := #5. 

Actually Erdos, Stewart and Tijdeman j9] proved this with the constant 1/2a/2 
replaced by 4 — o(l), as s — >■ oo, and Konyagin and Soundararajan [T7j showed a 
stronger lower bound e s2 2 See below for more about this. However, our proof is a 
little different from existing proofs, and motivates the proofs of our subsequent results. 
In §2 we shall also briefly discuss the n- variable S'-unit equation ax + a 2 + ... + a n — 0. 

Our main theorem concerns the two variable S'-unit equation that we have already 
mentioned several times. 

Theorem 1. Let e > be fixed. There are arbitrarily large sets S of prime numbers 
such that 

#{(a, c) G Z 2 : a + 1 = c, and if prime p \ ac then p G S} > e s /6 , 
where s := #<S. 

This will be proved in §3, and improves the previous best result of Konyagin and 
Soundararajan [T7j, who obtained this with the exponent 1/6 — e replaced by 1/16 (or in 
fact a number around 1/15.88). We will also show that if a widely believed conjecture 
about short character sums is true then one could prove this result with an exponent 
1/5 — e, and with an easier proof. The conjectured lower bound here is e sl/2 

In §4, we shall prove: 

Theorem 2. Let e > be fixed. There are arbitrarily large sets S of prime numbers 
such that 

#{(a, 6, c) G Z 3 : a + b + 1 = c; if prime p | abc then p G S; a, b, — c ^ — 1} > e s *° \ 
where Ao ~ 0.53551 is the real root of the cubic 4x 3 — 5x 2 + 9x — 4, and s := #S. 

We specify that our solutions will have a,b,—c ^ —1 to rule out large classes of 
"degenerate" solutions, such as a—a+1 = 1. The conjectured lower bound here is e s , 
whereas e.g. Erdos, Stewart and Tijdeman's argument [9] for the equation a + b = c 
can be directly adapted to give a lower bound e sl/2 \ Konyagin and Soundararajan's 
work [T7] on the equation a + b = c, which the author was unable to improve in that 
case, doesn't seem to extend to handle the equation a + b + 1 = c, because shifting 
by 1 (or any other non-zero quantity) seems to spoil one of their applications of the 
pigeonhole principle. As before, assuming a certain conjecture about short character 
sums allows an improved exponent, that is ~ 0.55496, in Theorem 2. 
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As indicated in our earlier discussion, our proofs involve estimating the number of 
small integer solutions to certain linear equations as the coefficients in an equation vary 
over "arithmetically interesting" sets. This is a topic that has already received quite a 
lot of attention: see e.g. the recent paper of Shparlinski [22J, and references therein, as 
well as the section on linear equations in Shparlinski's survey paper [23J. 

The results in §§3 — 4 are somewhat tailored to the proofs of Theorems 1 and 2, in 
that various parameters are optimised for those applications, but the techniques of the 
proofs should transfer to other contexts and may be of independent interest. In §5 we 
present two general results on solving linear equations on average, which extend some 
results in the literature but don't seem to lead to sharper conclusions about 5-unit 
equations. We also sketch an argument that, in a very limited sense, the arguments of 
§3 are somewhat optimal for obtaining conclusions about the S-unit equation a + 1 = c, 
which rules out a few obvious approaches to extending Theorem 1. 



2. Proof of Proposition 1 
To prove Proposition 1 we shall require two fairly easy lemmas: 

Lemma 1 (Part of Siegel's lemma, 1929). Let n > 2, and let ai,...,a n be integers 
bounded in absolute value by B > 1. Then there exist integers Zi, z n , not all zero and 
bounded in absolute value by (nB) 1 ^ 11 ^ 1 ^, such that 

a x Zx + a 2 z 2 + ... + a n z n = 0. 

Lemma 1 is a consequence of the pigeonhole principle: we follow the proof given in 
Chapter 2.4 of Baker's book fTJ to obtain the clean bound (nf?) 1 ^™ -1 ). For any C > 1, 
as yi,...,y n vary over non-negative integers bounded by C the linear form Y^=i a iUi 
takes at most n[CB] + 1 distinct values, noting that each term grows the set of 
possible values in one direction only, according to the sign of on. Here [■] denotes integer 
part. Thus if 

([C] + l) n > n[CB) + 1 

we must have J2i=i a iVi = YJi=i a iU'i for some (Vi, ■■■■> Vn) ^ (yi,—,y' n ), and then we 
can set (z 1} ...,z n ) = ( Vl - y[, ...,y n - y' n ). Since ([C] + 1)" > [C]([C] + l)"- 1 + 1 and 
n[CB] + 1 < nCB + 1, we see the desired inequality is satisfied when C = \{nB) l ^ n ^]. 

The full version of Siegel's lemma supplies small solutions to systems of linear equa- 
tions: see Baker's book pD], which also has much other interesting material. 

Lemma 2. Let v > 0. If x is sufficiently large, depending on v, then the following is 
true. If 1 < a < 100 (say), and T C [2, log" a;] is any set of prime numbers, and if 
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b > v is such that #T = (log x)/(aloglogx) 7 then 

x b/ 

irs horn T\ > — 

6 log b+1 x 



^.b/a+l/(2a log log x) 



#{n < x : n is squarefree, with all of its prime factors from T} > , (; _ , 



If we set k = [(loga;)/(a log logo:)], it will clearly suffice to prove such a lower bound 
for (# r ). But using Stirling's formula, since x is assumed to be large we have 

(* ) £ (#r '"'wVv! £ { * Tlk) ivr 

Then one calculates that 

(#T/k) k > ((#Taloglog a :)/log a ;)( losa; )/( aloslos:E )(#r/A;)- 1 = x b ' a {k/4T), 

and the inequality in Lemma 2 follows with some room to spare (using the trivial bounds 
k/Vk = Vk > 1 and #T < log 6+1 x). 

Now we can swiftly deduce Proposition 1. By e.g. the prime number theorem, if x 
is a large parameter we can find 3 disjoint sets Ti,T2,% of prime numbers satisfying 

1 2+(21og4)/loglogz: \aP bi+1 T 

%C [2,log 2+ ( 21og4 )/ loslog *x] and #71 > g ^ A , n . = ,1??. * = , 

6 J 41og(log 2+(21og4)/loglog:r a;) log(log 2+(21og4)/loglog:r x) 

where 6j = 1 + (log 4) / log log x. In view of Lemma 2, the subsets A% of [l,x] consist- 
ing of squarefree integers with all their prime factors from % each have size at least 

x l/2+l/(41oglogx)+0(l/(loglogx) 2 )_ 

Using Lemma 1, we see that 

^2 ^2 ^2 ^2 lai2i+a 2 22+Q3Z3=0 > ^ ^ ^ 1 

a±eAl a-2eA-2 a3&A-3 |z 1 |<v / 3~e,...,|z3|<v / 3~e, aieAl CZ2GA2 <X3£A3 

> 2-3/2+3/ (4 log log x)+0(l/(log log x) 2 ) 

In fact this is still true if one only sums over tuples (zi, Z2, z%) with no coordinate equal 
to zero: tuples with two or more zero coordinates make no contribution anyway, and e.g. 
the equation a^zi + ^3-23 = can have at most one solution with 02 6 A,«3 G A3 and 
-22,-23 7^ fixed, since such a 2 and a 3 are always coprime (by construction). Thus the 
total contribution from such "degenerate" tuples is at most 0(x(#^4 1 + #^4 2 + #A3)) = 

o((#A)(#A 2 )(#A 3 )). 

If we swap the order of the summations on the left hand side, this implies that for 
some popular (zi,Z2,z 3 ) (with no zero coordinates) we must have 

3/2+3/(41oglogx)+0(l/(loglogx) 2 ) 
V V V 1 n > - > r 3/(41oglogo ; )+0(l/(loglogx) 2 ) 

.A .A .A aiZl + - +a3Z3= ° ~ (2V^) 3 
aie.4i a 2 £A2 a 3 eA3 v ' 
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Now we shall fix this popular tuple (zi, z 2 , z 3 ). The numbers ar^i, ...,a 3 z 3 may some- 
times have a divisor in common, but we know this isn't the case for any a±, a^, so if 
we reduce each tuple (a\Zi, 0:32:3) by dividing out the highest common factor we will 
obtain distinct triples of coprime integers. 

At this point we have found at least e 3^/^\o g io g x)+o(iogx/(\o g io g x) 2 ) triples of coprime 
integers solving the equation A + B + C = 0, and having all their prime factors from 
the set 

{p prime : p < I og 2+(2iog4)/iogio g x ^ y ^ prime . p \ ZlZ2 ^y 
This has size at most 

L11 2+ (2io g 4)/io g io g , x 17.61og 2 x iQ1 /- o ^ 8.9 log 2 x 

, n 2+(2io g 4)/io g io g x . +3 log V 3a: / log 2 < — — +3 log V3x / log 2 < - — 

logflog +( g >' g g x) 2 log log a; log log x 

when x is large, and (setting s equal to this, and comparing with e 3l °s x /( ilo s lo s^)^ our 
claimed lower bound e ( 1 / 2 v / 2)v /s / 1 °s s immediately follows. 

Q.E.D. 

For general n > 3, Evertse, Moree, Stewart and Tijdeman [UJ showed (if we translate 
from their statements about rational variables to statements about integer equations) 
that there exist arbitrarily large sets S for which 

di + a 2 + ... + a n = 

has at least e s * 1} non-degenerate S-unit solutions. In fact they had a bit more 
precise bound. Here non-degenerate means not only that the variables have highest 
common factor one, but also that no proper subsum of the variables vanishes. This is 
implied by the highest common factor condition when n = 3, but not for larger n, where 
one wants to rule out large numbers of uninteresting solutions such as a — a + 1 — 1 = 0. 
By the reasoning explained in the introduction, it is conjectured that one should be 
able to have e sl 1/n non-degnerate solutions, so (in a certain sense) our knowledge 
is not too far from the presumed truth when n is large. 

Our proof could be adapted to give a lower bound e sl 1 01 for all n, by consid- 
ering n disjoint sets % C [2, logM/C"- 2 )-* x ] of primes, but it is not obvious how it 
could be made to respect the condition of no vanishing subsums. In view of Evertse, 
Moree, Stewart and Tijdeman's work [UJ, it probably isn't much worth pursuing this. 

3. Proof of Theorem 1 

3.1. Preliminary observations. Let 5 > be a parameter, which we shall take to be 
very small. Also let W, X, Z be numbers (that should be thought of as large) satisfying 
W < min{A, Z} and X 1 / 100 < Z < A 100 , say. Following the general strategy set out 
in the introduction, we shall prove Theorem 1 by showing that, if C C [A 1_<5 , A] and 
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A C [Z 1 ^ 5 , Z] are sets having certain properties, 

X 1+1015 W 2 



^ ^ ^ ^ ^ ^ ^-cw—au=l for some u ^ ^/ 

aeA cec i<w<w 

Note that if cw — au = 1 then 1 < -u = (cu> — l)/a < XW / /Z 1 ~ 5 , so if the above holds 
then for some popular 1 < w < W and u we must have 

j^101<5 

Z~Z Z~Z 1 ^-a«=i > — ^T" > x& - 
aeA cec 

We will show that, in particular, one can choose A, C to consist of integers having all 
their prime factors from a set S' of size about log 6+€ X, where e > is the quantity 
from the statement of Theorem 1. We will then set 

S := S' U {p : p prime, p \ uw}, 

so that #<S < #<S' + (log W)/ log 2 + {\og(XW / Z 1 ' 5 )) / log 2 < log 6+2£ X if X is large 
enough, and 

C) G Z 2 : A + 1 = C, and if prime p \ AC then p e S} 
> c) G Z 2 : aw + 1 = cw, and if prime p \ ac then p G <S'} 

as claimed in Theorem 1. Note that we can make the set S arbitrarily large by choosing 
W, X, Z to be correspondingly large. 

To obtain the desired lower bound on the triple sum we shall use a discrete form of 
the circle method, i.e. we shall decompose the indicator function l cw -au=i for some u as a 
sum of "harmonicfl'. It turns out that decomposing into Dirichlet characters is more 
effective than decomposing into additive characters: thus we have 



ZZ/ZYI 1 «»-a«=l for some u = ^) Yl X(c)xHx(l) 

aeA ct=C l<w<W X mod a 

TTT#{ C eC:(c,a) = l}#{l<w<W:(w,a) 
^ 0(a) 



aeA cec i<w<w aeA cec i<w<w T x mod a 



aeA 



aeA X mod a, 

X^XO 



£*( C ) 



cec 



Kw<W 



where Xo denotes the principal Dirichlet character to modulus a. We will show that, 
under suitable conditions, the "big Oh" term is negligible compared with the first term, 
and also the first term is at least (twice) as big as the claimed lower bound X 1+im5 W 2 / Z . 



In the language of the very nice book of Iwaniec and Kowalski [TS] . 
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Before we launch into estimating the "big Oh" term, we observe that (as is hopefully 
suggested by our array of notation) it is likely to be beneficial to take A and C on rather 
different "scales", i.e. to choose Z and X to have different sizes. The author found this 
quite counterintuitive at first, since when solving the equation A + 1 = C one clearly 
must take A and C to be about the same size. However, given the presence of the 
extra variables u, w it turns out that some asymmetry can be very helpful: indeed, if 
we could show (as we might at best hope to) that |^ c ecX( c )| (t^C) 1 / 2 ^ for most 
non-principal characters Xi 101 an y V > (with the implicit constant depending on rj), 
we would clearly have saved most if C was very large compared with a. Also in some of 
our arguments it will be very helpful if we can "factor" the set C as a product, to give 
ourselves flexibility in arranging the sums we must bound. This will be explained more 
precisely in the following sections. 

Versions of the circle method involving Dirichlet characters are no doubt employed 
in many papers, but the author would particularly like to reference the work of Garaev 
and coauthors: the reader might consult e.g. a paper of Cilleruelo and Garaev [6] on 
the least number with totient in an arithmetic progression, and a recent preprint of 
Garaev [13], amongst much other work. These papers also emphasise the importance 
of working with variables on appropriate scales. 



3.2. A conditional proof. First we shall give a fairly straightforward proof of Theorem 
1, but conditional on the far from proven conjecture that if rj > 0, \ is any non-principal 
character mod a, and W is arbitrary, then 



Y *H 



Kw<W 



<^ r , VWaP. 



Actually, assuming this conjecture to hold for all a we shall prove the theorem with the 
stronger lower bound e sl/s 

Assume that we have sets Q C [Q 1_<5 ,Q] and 1Z C [i? 1_<5 ,i?], where QR = X, such 
that all of the products qr, q e Q, r eH are distinct, and are exactly the elements of 
the set C C [X l ~ s , X]. Then for any a G A, 



4>(a 



- y 



X mod a, 

x^xo 



c£C 



Kw<W 



4>(a 



- y 



X mod a, 

x^xo 



-en 



Kw<W 



< 



0(a) 



E 

X mod a, 
X^XQ 



<?e<2 



WaP, 



\0(a) Z-j 

\ X mod a 



53 x(?) 

qeQ 



■— y 



X mod a 



Y *( r ) 



reiz 
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using the conjecture and the Cauchy-Schwarz inequality. Now if Q and R are smaller 
than Z 1 ^ 6 , and therefore (recall §3.1) smaller than a, then the elements of Q will all be 
distinct modulo a, similarly for the elements of TZ. Therefore if we expand the squares 
in the last line, and just perform the summations over x, we will obtain a bound 

<„ VWa?y/#Q ■ #K = VW^^/^C < Z^VW^/#C, 

which is the best we could have hoped to establish. 

Before we go on, we shall introduce some useful non-standard notation. We will write 
B(5) to denote any quantity such that, for any a > 0, we have X~ a < B(5) < X a when 
X is large enough and 5 is small enough in terms of a. From place to place B(5) will 
denote different such quantities. 

With this notation, it remains to determine how we can set the relative sizes of W, X 
and Z, and construct the sets A and C, so that both 

2 J(a)* {c E C : ( °' a) = " W ~ W : (W ' a) = 1} " B ( 5 )* A ^WV¥C, 

(which is at least 2 Eae^(V0(«)) E x mo d a, x ^ xo I E ce c x{c)\\ J2i< w <w x(w)\, by the fore- 
going calculations), and also 

zZlT~^ c e C : (c,a) = 1}#{1 < w < W : (w,a) = 1} > B ^ W \ 

We shall choose C in such a way that all of its elements are coprime to all elements 
of A. Moreover we shall certainly end up having W > Z 1 ^ 1000 , say, so by the sieve of 
Eratosthenes-Legendre (as in e.g. chapter 3.1 of Montgomery and Vaughan [21]) we 
will have 

#{1 < w < W : (w, a) = 1} = W JJ(1 - l/p) + 0(d(a)) > (1/2)W JJ(1 - l/p) 

p\a p\a 

(if Z is large enough, and d(-) denotes the divisor function). Thus it will suffice to have 

VWt/#C >B(5)Z and #-4#C > B(5)XW. 

Finally if a > is any number such that #A > Z x ~ a and #C > X x ~ a both hold, then 
it will suffice to have 

W X/2 x (l-a)/2 > Z l-a x -a > B ^ W _ 

Comparing the two inequalities that we wish to satisfy, we must have 
Z < B{5)X {1 - a),2 (Z 1 - a X- a ) 1/2 => Z {1+a)/2 < B(5)X 1/2 - a => Z < B{5)X {1 - 2a)/{1+a) . 
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However, earlier in our argument we supposed that X = QR < (Z 1 ^ 6 ) 2 = B(5)Z 2 , so 
the inequality can only possibly be satisfied (for large X and small 5) if 

(l-2a)/(l + a) > 1/2, 

i.e. if a < 1/5. On the other hand, if we have a = 1/5 - e; and Z = X^j(i-2«)/(i+«) ; 
and W = x~ e X^ 4a+a2 y (1+a ^; and Q = R = X 1 / 2 ; and if 5 is sufficiently small in terms 
of e, and X is sufficiently large; then we will have 

w l/2 x {l-a)/2 = x -e/2 x (l-2a)/(l+a) > x -e B ^ x (l-2a)/(l+a) = B(5)Z, 

and similarly Z l ~ a X~ a > B{5)W and Q,R< Z 1 ' 5 , as we wanted. 
Finally we can let 7[, %, %, be three disjoint sets of primes satisfying 

T,' Slog : '-'.V):2.1og : '-'.V and #7" > ' 



101og(log 5+e X)' 

and let Q C [1,Q], U C an( i ^ ^ [!> ^] be the subsets of squarefree numbers 

having all their prime factors from 71,%,% respectively. By Lemma 2, these sets 
have size at least Qi-i/5+n( e )+o(i) > i ji-i/5+n( e )+o(i) > z i-i/5+n( e )+o(i) > and one can check 

in the proof of Lemma 2 that this will still hold if we remove numbers smaller than 
Q 1 ^ 5 , R 1 ^ 5 , Z 1 ^ 5 (respectively) from the sets. Now we can just define 

C = {qr : q e Q,r e 11}, 

where clearly all the products qr are distinct. This completes our construction of sets 
A, C producing many solutions of cw — au = 1, and all of whose elements have their 
prime factors from a set of size < log 5+e X. 

3.3. An unconditional proof. Now we shall prove Theorem 1 without the aid of any 

conjecture. To do this we shall require the following two results: 

Number Theory Result 1 (Polya-Vinogradov inequality, Polya, Vinogradov, 1918). 
Let x be a non-principal character to modulus q, having conductor r. For any integers 
M and N with N > 0, 

M+N 
n=M+l 

where d(-) denotes the divisor function. 



< d{q/r)\/r\ogr, 



Number Theory Result 2 (Multiplicative Large Sieve inequality, Bombieri, 1965). 

Let Y < Z be integers, and let Q be any finite set of positive integers. Then for any 
complex numbers a n , 

2 

E^TT E l r W! 2 E *( n K <7 J D(Q)max{Z-F,M(Q) 2 } £ d(n)\a n \ 2 , 

qeQ.®^-' xmodq Y<n<Z Y<n<Z 
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where d(-) denotes the divisor function, D(Q) := max 9g g d(q), M(Q) : = maxgggg, and 
t(x) denotes the Gauss sum of x- 



Probably both of these results are fairly familiar to the reader. Most books seem to 
state the Polya-Vinogradov inequality without reference to the conductor of x, claim- 
ing a weaker bound <C ^\ogq for the character sum. However, most (perhaps all?) 
proofs of the Polya-Vinogradov inequality really prove the stronger statement: see e.g. 
chapter 9.4 of Montgomery and Vaughan [21]. Number Theory Result 2 is proved in 
Bombieri's paper [2], but here too most books seem to state a restricted version where 
the summations are over primitive characters x only. Perhaps surprisingly, it will prove 
quite important to have access to Number Theory Results 1 and 2 in the less usual 
forms that we have stated. 

We proceed again to bound the "big Oh" term from the end of §3.1. We still assume 
that we can factor the set C as a product of Q and 71, as in §3.2, so that 



5^ (f)(0) ^ 

aeA X mod a, 

X^XO 



cec 



Y *K 



Kw<W 



5^ 0(a) 

a£A X mod a, 

X#XO 



qeQ 



Y^ 1 



Y *M 



Kw<W 



< 




V 



^ 0(a) ^ 

(z:J\ X mod a, 

X^XQ 



Y x ^ 



mil 



Y xH 

Kw<W 



If we assume, as before, that Q C [Q 1 ^ 5 , Q] for some Q < Z 1 ^ 5 , and we also apply 
Number Theory Result 1 together with a crude bound <$; r? Z v for the divisor function, 
we find the right hand side is 



\ 



E^y E ™m 

a£A X mod a, 

X^XO 



Y^ 1 



Here we wrote cond(x) to denote the conductor of the character x- However, if the 
modulus a is squarefree, which we will arrange to be the case for all a G A, then we 
just have cond(x) = |t(x)| 2 (see e.g. chapter 9.2 of Montgomery and Vaughan [2~T]). 
This means that we can use Number Theory Result 2 to estimate the above, obtaining 
a bound 
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Exactly similarly to §3.2, we now need to arrange that 

J]-^#{c eC:(c,a) = 1}#{1 < w < W : (w,a) = 1} > B(5) y/#A#C maxjv^R, Z} 
aeA 

and that 

E^TT^ C e C : (c,a) = 1}#{1 < «; < W : («;, a) = 1} > B{b ^ W \ 

oTa ^ Z 

Working as in §3.2, and letting a > be such that #A > Z^ a and #C > X l ~ a , as 
there, it will suffice if 

x (i- a )/2 W > B{5)Z {1+a)/2 maxjv^R, and z x - a X' a > B(8)W. 
Comparing the inequalities, if they are to be satisfied we must have 

z (i+a)/2 max {^ Z } < B(5)X^- a ^ 2 (Z 1 - a X- a ) =}► Z^ 1 )/ 2 maxjv/i?,^} < ^(^X^ 30 )/ 2 ; 

and if we impose the constraint that R < Z 2 this forces that Z < 5(5)X^ 1 ~ 3a )^ 1+3a ). 
On the other hand, we supposed earlier that Q < Z 1 " 5 , and so X = QR < B(5)Z 3 . 
This means that, under our constraint R < Z 2 , the inequalities can only possibly be 
satisfied if 

(1 -3a)/(l + 3a) > 1/3, 

i.e. if a < 1/6. The reader may check that if we tried to set R > Z 2 , then we would 
end up requiring that 

z (3a-i)/2^ < B(5)(QRY 1 - 3a)/2 < B(5)Z^-^ 2 R^- 3a ^ 2 ^R< B{5)Z 2 ^- 3a ^ 3a , 

which still cannot be satisfied for any value of a larger than 1/6. 

Finally we observe that setting a = 1/6 - e; and Z = x^X^ 3 ^/^ 3 "); and W = 
x -e X (i-5a)/(i+3a). and q = z i-s ( and R = x jq < ^2). our various inequalities and 

assumptions will all be satisfied. Theorem 1 follows on constructing the sets Q, 1Z, A 
exactly similarly to §3.2, simply replacing 5 + e by 6 + e in all of the exponents. 

Q.E.D. 

4. Proof of Theorem 2 

4.1. More preliminary observations. Our proof of Theorem 2 will follow much the 
same strategy as our proof of Theorem 1, so we shall be brief in recalling this. This time 
we shall have a small parameter 5 > 0, and large quantities W,X,Y,Z satisfying Z < 
min{X, Y } and X 1 / 100 <Y< X 100 , say. We shall prove Theorem 2 by showing that, if 
C C \X l -\X], B C [Y^ 5 ,Y] and A C [Z 1 ^ , Z] are sets having certain properties, 

\^\^\^ \^ viw/ (Y + XW\ 

/ J / J / J / J i-cw-au=b+l for some u^O > X I _ g I . 

aeA beB ceC i<w<w ^ ' 
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Note that we only want to count quadruples (a, b, c, w) such that cw — au = b + 1 
for some non-zero u, so that distinct values of a give rise to distinct values of au. In §3 
we didn't need to specify this, since when cw — au = 1 it was necessarily the case that 
u 7^ 0. We will be able to force that n ^ without any effort, since it will transpire that 
choosing Y = (Iiy) 1 ^ 1- *' = B(5)XW is roughly optimal, (recall the notation B(S) 
from §3.2), and then 

\cw\ < XW = Y 1 " 6 < b + 1, 

so if cw — au = b + 1 then u must be non-zero. We will not fix Y in this way yet, so that 
the reader can see the optimality of the choice in our calculations, but we will assume 
that Y < B(5)XW. This is a simplifying assumption that should not hurt us, since for 
given Y, Z, W and X < B(5)Y/W we would expect to only increase our quadruple sum, 
and not essentially change the quantity on the right hand side, by increasing X. 

In any event, if cw - au = b + 1 then -Y/Z 1 ' 5 < u = (cw - b - I) /a < XW/Z 1 ' 5 , 
so if the above holds then for some popular 1 < w < W and u^Owe must have 

y^ y^ y^ icu>-cm=&+i > x s . 

aeA bet3 cec 

We will show that, in particular, one can choose A,B,C to consist of integers having 
all their prime factors from a set S' of size log( 1 / A °) +e X, where A and e > are the 
quantities from the statement of Theorem 2. We will then set S := S' U {p : p prime, p \ 
uw} : and Theorem 2 will follow exactly as Theorem 1 did in §3.1. 

Once again we shall use Dirichlet characters to implement the circle method, noting 

^ ^ ^ ^ ^ ^ ^ ^ lew— au=b+l for some u 

aeA beB cec i<w<w 



<-cw—au=b+l for some u 



* £ E E E 

aeA b eB, ceC i<w<w 

(a,(>+l) = l — — 

= E TTT#{ & eB:(b + l,a) = !}#{c G C : (c, a) = 1}#{1 < W < W : (w, a) = 1} + 



aG^l X mod a, 



E 

cec 



E 



where Xo denotes the principal Dirichlet character to modulus a. We will show that, 
under suitable conditions, the "big Oh" term is negligible compared with the first term, 
and also the first term is at least (twice) as big as our claimed lower bound. 

As in §3 we will be working with variables on different scales, and indeed X, Y, Z 
will all end up being different to one another. However, since we are already summing 
over a third variable, namely b G B, we will not "factor" any of our sets as we did in 
the proof of Theorem 1. There is one extra observation that we need to make, which is 
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crucial to the proof of Theorem 2. We may assume that for each integer 0, 

#{(c, c) EC 2 :c-c' = n} <„ X" and #{(c, c', c", c'") e C 4 : cc'-c"c"' = n} <„ X" 

for any 77 > 0, similarly for elements from the set B. For if the first bound failed for 
some n^O and some rj > 0, then since c — c' + l = n + l we would already have 3> X* 7 
non-degnerate solutions (c, —c',n + 1) to our target equation. The same is true if the 
second bound failed, just noting that at most O v (X v ^ 2 ) of the quadruples (c, d , c" , d") 
can produce the same products cd, d'd", using a crude bound for the divisor function. 
(And we will be choosing C, B to consist of integers with all their prime factors from small 
sets, so these solutions would indeed be the many ^-unit solutions that we wanted). 



4.2. A conditional proof. Similarly to §3.2, we shall first prove Theorem 2 on the 
conjecture that if rj > 0, x is an y non-principal character mod a, and W is arbitrary, 
then 



E * 

Kw<W 



W , 



<^ VWa v . 



Actually we shall prove the theorem with the stronger lower bound e sXl E by assuming 
this conjecture, where Ai ~ 0.55496 is the root of the cubic x 3 — 2x 2 — x + 1 that lies 
between one half and one. 
For any a E Awe have 



1 

0(a) 



E 

X mod a, 
X^XQ 



beB 



E*(< 



cec 



Kw<W 



" 4>{a) 



< VWa v 



E 

X mod a, 



&eB 



cec 



\ 



X mod a 



E E*+D 



fceB 



\ 0(a) S 

\ X mod a 



cec 



using the conjecture and the Cauchy-Schwarz inequality. Expanding the squares and 
performing the sums over characters, we find this is at most 



WaP 



\ 



*b+Y! E v-* 

beB b'eB,b'^b 



#C + Y, X/ 1 «k-C 

cec dec,c'^c 



Since all of the numbers 6 — 6' are non-zero integers between — Y and E, and we may 
assume (recall §4.1) that no such integer has more than O n {Y v ) representations as b — f, 
the double sum over b, b' is <^ v Y v (l +Y/a) <C Y 1+ri / Z 1 ' 5 , and similarly the sum over 
c,d is <^ v X 1+v / Z 1 ' 6 . Here we used our assumption, from §4.1, that Z < min{X, Y}. 
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Recalling our useful notation B(5) from §3.2, and our assumption from §4.1 that 

Y < B(5)XW, it remains to determine how we can set the relative sizes of W, X, Y and 
Z, and construct the sets A, B, C, so that both 

E ^~^ b eB:(b + l,a) = 1}#{c e C : (c,a) = 1}#{1 < w < W : (w,a) = 1} 

> B(S)y/W#Ay/{#B + Y/Z){#C + X/Z), 

(which is at least 2 E ae ^( 1 /^( a )) E x mod a , x ^ Xo I Etee *(&+!) 1 1 E ce c x( c ) 1 1 Ei< w <w xH I , 
by the foregoing calculations), and also 

zZt^*^ eB:(b + l,a) = 1}#{c G C : (c, o) = 1}#{1 < w < W : (w, a) = 1} 

£(£)xvy 2 
- z ' 

We shall choose C in such a way that all of its elements are coprime to all elements of 
A, and we shall choose B such that #{6 E B : (b + 1, a) = 1} > #£>/2 for each a £ A. 
Thus, exactly similarly to §3.2, it will suffice to have 

y/W#B#C > B(5)Zx/(#B + Y/Z)(#C + X/Z) and #A#B#C > B(5)XW. 

Finally if a > is any number such that #A > Z l ~ a , #B > Y l ~ a and #C > X^ a all 
hold, then it will suffice to have 

VWY^X 1 -* > B(5)Z x /(Y 1 - a + Y/Z)(X 1 - a + X/Z) and z^Y 1 '* > B(5)X a W. 

It will be easier to think about the first constraint if we replace it by the four con- 
straints that come from multiplying out the brackets, namely 

VWY^X 1 -" > B(5)Z x /max{Y 1 ~ a X 1 - a , XY^/Z, X^Y/Z, XY/Z 2 }. 

It is clearly best to choose W as large as possible subject to the other constraint, i.e. 
to take W = B(5)X- a Y 1 - a Z 1 ' a ; this leaves us to satisfy 

y3(i-a)/2 X i-(3a/2) > B (5)Z (1+a)/2 y/max{Y i-^X 1 -", XY 1 -^ Jz, X^Y/Z, XY/Z 2 }. 

Now if we examine the powers of Y on the left and right hand sides, on the left side 
we have a power 3(1 — a)/2, and on the right the largest power of Y is 1/2. Since we 
shall certainly end up having a < 2/3, (and in fact we shall end up with a = Ai — e, 
of course), the power on the left hand side will be larger, so it will be best if we choose 

Y as large as possible. We have been supposing since §4.1 that Y < B(5)XW, so we 
should take Y = B(5)XW = B(5)X 1 - a Y 1 - a Z 1 - a , i.e. Y a = B(8)X 1 - a Z 1 ~ a . Noting 
that X x ~ a Y /Z > XY 1 ~ a /Z when we make this choice (since Y > X), so that one of 
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the four terms in our maximum becomes redundant, we are left to satisfy 

Z 3(l-a) 2 X 2 a -3a 2 +3(l-a) 2 > B(5)Z a( - 1+ °^ max{Z (1_a)2 X 1 " Q , X (a+1)(1 ~ a) Z^ 2a , X Z^ 3 "} 

= B{5) ™*{Z l - a+2a2 X l ~ a , X ^ +l ^ l -^Z l - a+a \xZ {l - a)2 }. 

(Here we raised both sides of our previous inequality to the power 2a, to clear denomi- 
nators in the exponents). 

On shifting all the powers of X and Z to the right hand side, this remaining constraint 
becomes 

max {^-2+5a-a 2 x -2+3a )X -2 + 4a-a 2 Z -2 + 5a-2a 2 )X -2+4a z -2(l-a)^ < ^ 

The powers of Z in the first two terms are non- negative for all 1/2 < a < 1, so it will 
be best to choose Z as small as possible such that the third term is smaller than B(S), 
i.e. to choose Z = B(5)X^ 2a ^/^ a)2 . Making this choice leaves us to satisfy 

max{X (2a ~ 1)(_2+5a_a2)+( ~ 2+3a)(1_a)2 , x^ 2+4a - a2)( - l ~ a)2+( - 2a ~ 1)( -- 2+5a ~ 2a2) } < B(S), 

where we raised both sides of our previous inequality to the power (1 — a) 2 . Since X is 
supposed to be a large parameter, this just amounts to asking for the maximum of the 
two exponents to be negative, i.e. for 

max{-2a + 3a 2 + a 3 , -a + a 2 + 2a 3 - a 4 } < 0, 

where 1/2 < a < 1. Cancelling one multiple of a, we want that a 2 + 3a — 2 < and that 
a 3 — 2a 2 — a + 1 > 0. The reader may check that the second of these inequalities is a 
stronger condition (i.e. forces a to be smaller), and it will be satisfied when a = Ai — e, 
by definition of Ai as the relevant root of that cubic. If a is chosen in this way; and 
Z, Y, W are chosen in terms of X in the way listed above, that is if 

Z = B(5)X^ 2a ~ 1)/(1 - a)2 and Y = B(5)X a/{l - a) and W = B(5)X {2a ^ )/{ - l - a) , 

for suitable quantities B(5); and if X is sufficiently large in terms of e, and S is sufficiently 
small; then the reader may check back that all of our constraints (including that Z < 
min{X, Y}) will be satisfied. 

Finally, and similarly to §3.2, we can let % be the set of all primes strictly smaller 
than (log*- 1 ^ 1 ^ 6 X)/2, and let Ti,% be two disjoint sets of primes satisfying 

Tl c i(io g <^>+< A-)/ 2 , ,o g '^.. + . x] and #7 - > 5i ^ 1 g / ;;;;;,;! A , r 

Then we can let C C [1,X], B C [1,Y] and A C [l,Z] be the subsets of squarefree 
numbers having all their prime factors from 71,72,73 respectively. By Lemma 2 these 
sets have size at least x 1 - Al+n W+°( 1 ), yi-M+n(e)+ (i) ; z i-x 1+ n(e)+o(i) ^ and one can check 

in the proof of Lemma 2 that this will still hold if we remove numbers smaller than 
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X 1 s , Y 1 s , Z 1 5 (respectively) from the sets. Clearly each element of C is coprime to 
each element of A, and for each a £ A we have 

#{6 e B : (b + 1, a) ^ 1} < y #{6 e B : b = -1 mod p} < — < #£/2 

provided X is sufficiently large, using any of several results about the distribution of 
smooth numbers in arithmetic progressions. See e.g. Granville's paper [14] . 

This completes our construction of sets A, B, C producing many solutions of cw—au = 
6+1, and all of whose elements have their prime factors from a set of size < log (1/Al)+e X. 



4.3. An unconditional proof. Now we shall prove Theorem 2 without the aid of 
any conjecture. We shall need the following result, which is essentially Lemma 3 of 
Friedlander and Iwaniec's paper 



Number Theory Result 3 (Friedlander and Iwaniec, 1985). For any q > 2, any 

N > 1, and any r\ > 0, 

4 



- y 



<P{q) 



X mod q, 
X^XQ 



y 



Kn<N 



where xo denotes the principal Dirichlet character to modulus q, and the implicit con- 
stant depends on r\ only. 

Looking to bound the "big Oh" term from the end of §4.1, we see that for each a E A, 



4>(a 



- V 



X mod a, 

x¥=xo 



beB 



£*(< 



w 



Kw<W 



< 



\ X mod a 



beB 




X mod a 



C6C 



1/4 



V 



- y 

Jn\ Z^ 



X mod a, 

x#xo 



y xiw) 



Kw<W 



1/4 



\ X mod a cGC 

using two applications of the Cauchy-Schwarz inequality, Number Theory Result 3, and 
the same argument as in §4.2 to estimate the sums involving B. The sums involving C 
have size at most the number of quadruples (ci, C2, C3, C4) G C A for which a\{c\C2 — C3C4), 
and this is <Cj X V ((#C) 2 + X 2 /Z 1 ~ s ), exactly similarly to the argument in §4.2 (where 
the X V {^C) 2 term counts quadruples with Cic 2 = c 3 c 4 ). Actually, since we will end up 
choosing C such that #C < \/X, and we have been assuming since §4.1 that Z < X, 
the term (#C) 2 is redundant in this bound. 
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Now, exactly similarly to §4.2, we need to choose W, X, Y, Z and the sets A, B, C so 
that both 

zZj(a)#{ b eB:(b + l,a) = 1}#{c e C : (c, a) = 1}#{1 < w < W : (w, a) = 1} 

> B(6)y/W#Ay/ (#B + Y/Z)(X/Vz), 
and also 

G B : (6+ l,a) = 1}#{c G C : (c, a) = 1}#{1 < w < W : ( W , a) = 1} 

ae.4 ^ > 

B(S)XW 2 
~ Z ' 

In particular, if a > is such that #A > Z 1 -*, B > Y x ~ a and #C > X 1 "", as in §4.2, 
then it will suffice if 

VWY 1 -* > B(5)X a - 1/2 Z 3/4 v / max{Y 1 ~ a , Y/Z} and Y l - a Z l - a > B(5)X a W. 

If we choose W as large as possible, subject to the second constraint, then the first 
constraint becomes (after squaring both sides) 

(yl-a z l-a X -a)y2(l-a) > B{5)X 2a ~ 1 Z 3/2 max^ 1-0 , Y/Z}. 

Comparing the powers of Y on either side, since we shall certainly end up with a < 2/3 
the power on the left will be greater than on the right, and so we should choose Y as 
large as we can. We have been supposing since §4.1 that Y < B(5)XW, so if we set 
Y = B(5)XW = B(5)X 1 - a Y 1 - a Z 1 - a , i.e. Y a = B(5)X 1 - a Z 1 ~ a , then our constraint 
becomes (after raising both sides to the power a) 

Z a(l-a) x -a^ x l-a z l-a^3(l-a) > max^X 1 ^ 1 ^) 1 ^, (X^" Z 1 '*) / Z a } . 

If we shift all powers of X and Z to the right, and simplify as much as possible, this 
remaining constraint becomes 

max{X- 2 + 3a+a2 Z - 2+9a / 2 - a2 ,X- 2+4a Z- 2+9a / 2 - 2a2 } < B(5). 

When 1/2 < a < 1 we have — 2 + 9a /2 — a 2 > 0, so if the above is to hold we must have 

X (-2+4a)(-2+9a/2- a 2 ) < £^ Z (2-9a/2+2a 2 )(-2+9a/2-a 2 ) < £^ X (2-9a/2+2 a 2 )(2-3a-a 2 ) _ 

X is a large parameter, this amounts to asking that 

(-2 + 4a)(-2 + 9a/2 - a 2 ) < (2 - 9a/2 + 2a 2 )(2 -3a- a 2 ), 

or equivalently, after simplifying, that —2a + 9a 2 /2 — 5a 3 /2 + 2a A < 0. Since we are 
looking to have 1/2 < a < 1 we can cancel one multiple of a, and are left wanting 
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4a 3 — 5a 2 + 9a — 4 < 0. This will be satisfied when a = A — e, by definition of A as 
the real root of that cubic. 

If we choose a in this way; and put Z = B(5)X^ 2+4a " l ^ 2 ~ 9a ^ 2+2a2 \ and similarly 
choose Y = B(5)X ( -- 2a2+5a ^ 2 - 1 ^/ ( - 2 - 9a / 2+2a ^ and W = B(5)X^ 4a2+7a - 5 / 2 ' l ^ 2 - 9a / 2+2a ^ 
as listed above; then our various inequalities and assumptions will all be satisfied. Theo- 
rem 2 follows on constructing the sets A, B, C exactly similarly to §4.2, simply replacing 
1/Ai + e by 1/Ao + e in all of the exponents. 

Q.E.D. 

5. Further results and discussion 

5.1. "Optimality" of the arguments in §3. The arguments in §3 established the 
existence of many solutions to the 5-unit equation a + 1 = c by counting solutions to 
an associated linear equation on average over its coefficients. To do this they made use 
of three kinds of tools, namely: 

(i) obvious uses of the orthogonality of characters x\ 

(ii) bounds for incomplete character sums Y^n=M+\ x{ n )\ 

(iii) the multiplicative large sieve. 

The ways in which these were combined in the proofs may have seemed a bit arbitrary, 
but in this subsection we will sketch an argument that, roughly speaking, one cannot 
improve upon the results from §3 (or at least the conditional result in §3.2) if one only 
uses these tools. This suggests that if one wishes to make further progress towards 
the conjectural bound e sl/2 E in Theorem 1, one will need either different general tools 
(some possibilities being discussed in the next subsection), or more specific information 
about e.g. character sums over smooth numbers. The reader will note that the only 
information we used about smooth numbers was an estimate for how many there are, 
as in Lemma 2, and the fact that we could factor certain sets of smooth numbers in 
useful ways. 

We suppose we are arguing in the general framework of §3, so (as in §3.2 and §3.3) 
we are seeking the largest value of a for which both 



X^WZ^>B(5)J2-^S E 



A 0(a) 

Cl£A X mod a, 

X^XQ 



5> 



1<w<W 



and 

Z l-a X -a > B{8)W, 

where C C LY 1 "" 5 , X] and A C [Z 1 ' 5 , Z\ are sets satisfying #C > X^ a and #A > Z 1 ' . 
We need to bound the character sums in the first inequality, and let us restrict ourselves 
to doing this by performing some combination of the following "moves" : 
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use the bound E x mo d a |Efec c iX(l) \ < Ei G £ M 2 for an y set ^ contained 
in an interval of length < a, and any complex numbers q; 
apply the Polya- Vinogradov inequality and the large sieve, as in §3.3; 
use the bound |Ei<it<vF | < B(5)y/W, where x is an y non-principal char- 
acter mod a. 



The bound in the third move is stronger than we can presently prove, so assuming that 
we can freely use it will make our optimality result stronger. We will also assume that 
we can factor C as a product of sets Qi, Qk, for any fixed k that we wish, and a priori 
for any sizes of sets Qi that we wish. (Here we mean that the elements of C should be 
the products qiq2---Qk, for G Qi, and that these products should be distinct). 

Let k > 2, and let X C {2, 3, k} be any subset. We think of the set C as having 
been factored into sets Qi, Qk, and of X as denoting the set of indices % for which 
we shall estimate the character sum over Qi using a "large sieve move" . These are the 
character sums that we shall need to pair up with a sum | Ei<u><w x( w )\i so that we can 
use the Polya- Vinogradov inequality and obtain the Gauss sum weights needed in the 
large sieve. (Recall §3.3). The character sums over the other Qj will be estimated using 
an "orthogonality move", with the associated copy of | Ei<iu<w x( w )\ (which will arise 
from applying the Cauchy-Schwarz inequality) being estimated using our conjectural 
bound B(S)VW. Note that we may certainly assume, after possibly relabelling Q x as 
Q 2 , that the large sieve is not used to estimate the character sum over Q\, because if it 
was we would end up with precisely the argument given in §3.3. 

J2cecX(c)\\J2i< w <wX(w)\ is 



NoW Eae.4 V <K°) E x mod a, 

XT^XQ 



^<J>(a) ^ (n 

a&A X mod a, \i = \ 

x#xo 



Y x ^ 

qeQi 



Y x ^ 



< 



X mod a, 



Y— T 

/ 

V 



n 

2<i<k 



Y x(q) 

qeQi 



l<w<W 

I 



n 



Yj A(„\ Yj 



X mod a, 
X^XQ 



Y x ^ 

qeQi 



Y x ^ 



Kw<W 



(l£A X mod a, 

x¥=xo 





\ 1 / 2 ' / 


Y 




q&Qi 


) \ 



d<=A X mod a, 

X#XO 



Y x ^ 

Kw<W 



2 fc -E l6 i2 fc - !+1 



l/2 fc 



by repeated applications of the Cauchy-Schwarz inequality. Given our limited set of 
moves, the only way we can handle the large powers of character sums is to note that 



Y x ^ 




Y #{(Ji, -, J*-0 G = Ji-J*-i = *M0 

'<KQf 
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if Qi Q [Ql~ 5 ,Qi\, say, where Yli=i Qi = X. Thus if i X, so we will be using 
orthogonality, we need to have Qf _1 < Z, and then l/0(a) Y. x mod a I T iq eQ 1 x(q)\ 2% will 
be at most 

= X"(#Q>r\ 

using a crude bound for the divisor function. Similarly if i e X, so we will be using the 
large sieve, we need to have 1 < Z 2 , and then we can have a bound 

2* 



(IG.4 X mod a, 

X^XQ 



<?eS; 



Kw<W 



Applying these bounds and our conjectural estimate for J2i< w <w xi w )i we see that 
our target inequalities become 



X l - a WZ- a > B(S), 
and 



^ i=l 



rt _a > B(5)W. 

Here we have Yli=i(^Qi) = — A 1_a , since Qi were the factors of C, and we also have 
X = Yli =1 Qi ~ Z 1+ ^ 2 <i<fc.^ 2:2l_,+ ^ex 22_ * — ^2-2 1 - fc +E ie i2 1 -' in view f our di scuss i on 

about the permissible sizes of the Qi. If we set 9 = Xliex^ - *, and simplify where 
possible, our target inequalities become 

x (i-a)/2 W i/2+e > B (5)Z lHl+a)e and Z x - a X~ a > B(8)W, 

where X « Z 2 - 2l ~ fc + 2e . 

Now we can find the largest permissible value of ct exactly as in all our previous 
calculations: it turns out that we must have 

1 - 2 1 ~ k + 20 1 (1 - 2 1 - k + 29) 



a < 



5 - 2 2 " fc + 120 - 2 2 ~ k 9 + 40 2 5 (1 - (2/5)2!- fc + (12/5)0 - (l/5)2 2 ~ fe + (4/5)0 2 ) 

In particular, the right hand side is < 1/5 whenever k > 2, so that we cannot use these 
methods to improve on the conditional bound 1/5 — e from §3.2. 

The reader might also note e.g. that for any given k, we must either have 9 = or 
9 > 2~ k , and so we always have 120 - 2 2 ~ fc + 40 2 > 120 = 6 • 20. This means that, if 
we have access to a strong bound |Xa<«Kiy x( w ) | < B(5)y/W for incomplete character 
sums, we will have the best chance of improving on our exponent 1/6 — e (from §3.3) if 
we do not use the large sieve at all, so that = 0. 
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The alert reader will note that we did not really need the bound | Yli< w <w x( w ) | < 
B(5)\/W in the above calculations, but rather a bound B(5)W t for the mean value 

I 1 2t 

mod a, x ^xo I ^i<w<w x( w ) \ i f° r certain natural numbers t. Arranging things 
so that we only needed a mean value estimate cost us one extra application of the 
Cauchy-Schwarz inequality, which explains why we did not recover the value a = 1/5 — e 
when = 0. When t — 2 such a mean value estimate is a known result, (indeed we 
stated it as Number Theory Result 3, in §4.3), but so far as the author is aware there is 
no known proof of such an estimate for any t > 2. Consequently we cannot rigorously 
implement the foregoing calculations for large values of k, except if the large sieve is 
used lots of times; and in that case 9 will be quite large, which we just noted was 
undesirable for obtaining a strong result. An interested reader may care to explore for 
what value of t one would need a sharp mean value bound to improve on our exponent 
1/6 - e. 

5.2. More results on counting solutions to linear equations on average. In this 
subsection we will give two general results on solving linear equations on average over 
their coefficients. The author proved these whilst pursuing the work in §3, but as we 
shall discuss the results they supply about 5-unit equations are not as strong as those 
already presented. We will also briefly discuss some other ideas for obtaining results of 
this kind, and mention some connections with work of other authors. 

Unlike the arguments in §3, the following theorem can supply quite good information 
if one wants the coefficient sets A, C to be on the same scale, i.e. if one wants to take 
X and Z about the same size: 

Theorem 3. Suppose that C C [3X/4, X] and A C [3Z/4, Z\ art any sets of natural 
numbers, and that the set C "factors" as a product of some sets Q C [3Q/A,Q] and 
1Z C [3i?/4, R], in the sense that QR = X , and 

C = {qr : q e Q,r e 11} , 

and all of these products qr are distinct. Also suppose that Z < X , and that each 
element of C is coprime to each element of A. Let l/y/Z < // < 1 be any number, and 
suppose that 1Z satisfies #1Z > l//i. Then for any r\ > 0, 

E E =1 for somen = (1 + 0(1/ log Z))»#A#C + O (# Ay/ #CX (fog Z)/ (fJ,Z)) + 

a£A l<w<fia ceC 

+O v (X r > x / f i#A#C( x /QZ + (XRf\ZR 2 + Z 2 + X/fi) 1 ^)), 

where the constants implicit in the "big Oh" notation are absolute except in the third 
case, where the constant depends on rj. 
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In earlier sections we summed over 1 < w < W, whereas here the range of summation 
varies a little with a. This makes no real difference when applying Theorem 3, since all 
the elements a G A are roughly the same size, but this formulation makes some steps 
of the proof a bit neater. The requirement that each element of C is coprime to each 
element of A could be removed, if the conclusion of the theorem was adjusted suitably. 

The proof of Theorem 3 will occupy most of the rest of this paper. We apply the 
circle method with additive characters, rather than multiplicative characters, noting 
that the triple sum is 

E E E; E e«<r' -«,)/„) = + 

aeA l<w<[na] ceC -a/2<h<a/2 aeA 

+E E <; E «<=r)>E«<^)- 

aeA -a/2<h<a/2, Kw<\ua] CgC 

Here, as is usual, we let e(t) = e 2mt , and used c -1 to denote the multiplicative inverse of c 
with respect to the relevant modulus (in this case a). Now for non-zero —a/2 < h < a/2, 

1 v-^ le(—h[ua]/a) — l 1 e(—h[ua]/a) — 1 . 
- > e(-hw a) = — \ , ,\ — = — r / + 1/a . 



l<-u)< [/*a] 

This is easily seen if one distinguishes the cases where \h\ < a/100, say, and where 
is larger (and so the whole sum is 0(1)). Moreover we have 

1 e(—h\aa]/a) — 1 . . 1 e(—ha) — 1 . . . . . 

— v J j + o(i / a ) = — v — — _|_ 0(1 a) =: sJh) + 0(1 a), 

say, and so for any A < 3/8 (so that XfiZ < (1/2) min ag ^a) our triple sum is 

^A#C + E s^h)J24hc- 1 /a) + 0((\^ + l/Z)#A#C) + 



aeA -x^z<h<\^,z, 



ceC 



+ E E (-„ E e(-W«))E e (^Va). 



aS.4 -a/2<h<a/2, 
\h\>\iiZ 



l<w<[na] ceC 

To deal with the sums over large |/i|, we can fix a E A and H < Z and note that 



E 

cec 



E 4 E «=£)M^: 

fl"<|/i|<2# 1<«><[H 



< 




a-l 

E 

r=0 



E (s E 

tf<|/i|<2.ff 1<U)<[H 



—hw,. .hr, 
e( ))e(— . 



a 



a 
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on slicing the set C into Q(X/a) pieces of length a. Then we can expand the square and 
perform the summation over r, obtaining a bound 



« V¥c 



\ 



a—1 ^ 

X Jp#{ H <\h\<2H:h = r mod a} 2 < ^#CX/H 



r=0 



since we have H < Z <^ a and \(l/a)J2i< w <[[ia] e (~hw/a)\ ^ 1/H. This esti- 
mate clearly implies that the contribution from \h\ > XfiZ to our triple sum is <C 
H^Ay/ i^CX/ (A/iZ), and on choosing A = 1/ logZ this is acceptable^ in Theorem 3. The 
error term 0((A/x + 1/Z)#A#C) is also 0((fi#A#C)/ log Z) with this choice, since we 
assumed that /x > l/\fZ. 

We are left with a neat looking sum of weighted Kloosterman sums, which we shall 
estimate using a deep result of Deshouillers and Iwaniec [7J . The author was inspired to 
do this by the very nice use of Kloosterman sum estimates in a paper by Matomaki [20J 
(concerned with a rather different problem). It also works out a suggestion in Question 
32 of Shparlinski's survey paper [23] . We will state the relevant result now, although it 
will require a little preparation before we can apply it. 

Number Theory Result 4 (Deshouillers and Iwaniec, 1982). Let C, D, N, R be num- 
bers at least as large as 1/2, and b n ^ r be a complex sequence. Let g(c) be an infinitely 
continuously differentiable function, having compact support in [C,2C], and satisfying 

d v 9 

-r- <t> c v 
dc v 

for any v > 0, the implicit constant depending on v at most. Let h(d), having compact 
support in [D, 2D], have the same properties. Then for any e > 0, 



&n ' r g(c)h(d)e(n(rd) 1 /c) 

R<r<2R0<n<N c,d, 

(rd,c) = l 



< e {CDNR) e K{C,D,N,R)J £ £ 

R<r<2R0<n<N 



where (rd) 1 stands for a solution of (rd) x rd = 1 (mod c), and 

K 2 (C, D, N, R) = C(R + N)(C + DR) + C 2 D y /(R + N)R + D 2 NR. 

This is a slight restatement of Theorem 12 of Deshouillers and Iwaniec [7], where 
we have simplified the allowed type of smooth weight function somewhat, and have not 
mentioned an additional variable that one can have in their result 



5 If this estimate was unacceptable for some application, one could probably reduce the contribution 
from large \h\ to a satisfactory level by counting the numbers w with a smooth weight (rather than the 
sharp cutoff to the interval 1 < w < [/xa]). We leave it to the interested reader to work out the details. 
6 If one assumes that Selberg's eigenvalue conjecture is true this leads to an improvement of Number 
Theory Result 4. There seems to be a misprint in the statement of Theorem 12 of Deshouillers and 
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Initially we shall work to bound J2 aeA Yl\h\<i/n,hjto s mW 52c£C e ( hc V a )- Preparing 
to apply Number Theory Result 4, we note firstly that (very similarly to the proof of 
Lemma 1 1 of Matomaki [20] ) 



E E ^(")E e 



he 



-i 



cec 



EE E ! *>E' 



aG.4 <?eS |h|<i, 



\ 



Y 9z{a) Y h Q^ 



2Q<„<4Q 
(«,a)=l 



Y s ^ Y e 



hq~ 1 r 1 



|h|<i, 



(r,a) = l 



Here 1[ 3 z/a,z] < #z < lpz/3,4z/3] and 1[3Q/4,Q] <h Q < 1[ 2 q/3,4Q/3] can be any compactly 
supported functions having the smoothness properties required in Number Theory Re- 
sult 4. Examining the contents of the second squareroot sign, if we expand the square 
and reorganise the terms we find this is 

,q~ 1 r± 1 r 2 1 (h 1 r 2 - h 2 r 1 ) 



Y Y ^C'^A) Y M?) Y 9z{a)e{ 



|hll,|fc 2 l<£. ri - raeW ^<Q<^ 



h 1 ,h 27 ^0 



(a,gr 1 r 2 ) = l 



/ 



E E 

f <r<R? -F<"<^?> 



fell, IM^VfAi. h 2 ¥=o 

I r 1 ,r 2 STC,r 1 r 2 =r, 
\ /iir 2 -h2 r l= n 



Y Y 9z(a)e( 



q V 1 n 



J 



2Z <n< 4Z 
(a,qr) — 1 



/ ll r 2 =h 2 ri 

X hqiq) Y 9z ^- 



h!,h 2 ^0 



If we let 6(n, r) denote the bracketed term on the second line, and note that |s M (/i)| \i 
for all /i, we find the above is 



Y Y 6 ( n > r ) Y h Q^) Y 9z(a)e 



q r n 



) + o ri (^nx^QZ), 



2Z <n< 4Z 
(a,qr) — 1 



where the term X v (for arbitrarily small 77 > 0) arises as a crude bound for the divisor 
function evaluated at hir 2 - 



Iwaniec [7], but the author believes that on Selberg's conjecture one can replace y/(R + N)R by VWR 
in the expression for K 2 (C, D, N, R). See below for a further remark about this. 
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By Number Theory Result 4, with the quantities C, D, N, R set as 2Z/3, 2Q/3, 2R/[i, R 2 /2, 
the remaining sums are 



< e {XZ) e \jz{R 2 + R//j,)(Z + QR 2 ) + Z 2 Q^(R 2 + R/ n)R 2 + Q 2 R 3 //j, /^|6(n,r)| 2 . 

y ™i r 



Moreover we have, using the Cauchy-Schwarz inequality, 



J2\b(n, 



< 



n,r 



E E E 1 

zM< n <2fl_ fi 2 . <r< ^2 Vri,r2e7?,,rir2=r 

n^O 2 - - 



E 



ri,r2G"R.,rir2=r 



S »( h l) S »( h 2) 



h \ r 2~ h 2 r \= n 



( 



«, *v E E 



^2fl< n <2fi ri,r 2 en |hil,IM<i//«, 

n/0 \ ftir 2 -h 2 n=™ / 



E i 



E 



L r'2 (/ll — /l3)=n {h2-hi) 



ri,r2(z1Z -i/m<'»i,'»2.' 1 3>' 1 4< 1 /m. 

h^,h,2 , /13 ,/i47^0 



The new term in the last line arises as a crude bound for the number of divisors of 
r 2 (hi — h 3 ), when hi ^ h 3 . Finally, since QR = X, and by assumption R > #1Z > 
our bound from Number Theory Result 4 simplifies to 



< e (XZ) £ y/ZR?(Z + XR) + Z 2 XR + X 2 R/ny/n*(#n/n) 2 

(where we noted that Z 2 R 2 < Z 2 QR 2 = Z 2 XR). Checking back over our calculations, 
recalling that we need to take a squareroot and multiply by y/#A#Q (and replacing 
(XZ) € by X v ), we have shown that 



E E *<.(") E e 

a£A |h|<i, ceC 



he 



<„ X* y /#A#Qy/iJ,#'JlQZ + VXR^ZR 2 + Z 2 + X//i(/i#7^) 



< IV^#C(#+ {XRf/\ZR 2 + Z 2 + X/^) 1 / 4 ), 



which is acceptable for Theorem 3. In the second line we noted that #C = #Q#TZ. 

It only remains to bound Ei/^^/iogZ s mW E ce c efa^/a), presuming 

that the sum over h is non-empty. However, the reader may check that if we split this 
into sums over dyadic ranges H <\h\< 2H, beginning with H — and repeat the 
above argument on each range, then we always obtain at least as strong a bound as we 
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did for the sum over < \h\ < Indeed, the only thing that changes is that we must 
sum over \n\ < 4RH rather than over \n\ < 2R/[a, and that we have the stronger bound 
<C l/H rather than |s M (/i)| <C fi, and the latter always at least compensates for 
the former. Theorem 3 then follows. 

Q.E.D. 

Unfortunately Theorem 3 is not a completely general result, since it requires one 
to be able to factor the coefficient set C. This may seem a slightly bizarre state of 
affairs, but in fact one encounters related issues in many different settings: for example, 
Bombieri, Friedlander and Iwaniec [3j HI [5] were able to extend the range of summation 
over moduli in the famous Bombieri- Vinogradov theorem, but only in the presence of 
certain "well-factorable" weights that made it possible to apply results about averages 
of Kloosterman sums. There is related work, requiring the same well-factorable weights, 
on several important problems of analytic number theory. 

Before moving on, we mention a few other ways in which one might proceed if study- 
ing linear equations using additive characters. Shparlinski [22] applies a deep estimate 
for bilinear forms with Kloosterman fractions due to Duke, Friedlander, and Iwaniec [8], 
and thereby manages to obtain some information even if the sets A, C are on the same 
scale and do not factor. However, the saving over trivial bounds using that result is 
something like (X + Z) 1 / 48 , which is much less good than in Theorem 3. If one has sets 
that do factor, one could also try applying the Kloosterman sum estimates of Karat- 
suba [H], which have the advantage that they do not require one to average over a G A. 
(Or, from another point of view, the disadvantage that they can't exploit any such 
averaging). In fact there are a great variety of estimates available for different kinds of 
averages of Kloosterman sums, many of which the author has not explored but may be 
of use in this context. 

We remark that using Theorem 3, the author was able to prove Theorem 1 with a 
lower bound e sl/1 °~ e , by setting Z = Q = X 2 / 3 , R = X 1 / 3 , and n = Z~ 1 ' 4 = X" 1 / 6 . 
Slightly remarkably, if one works on these scales then it makes no difference to the 
quality of this result whether one uses Theorem 3 or a version conditional on Selberg's 
eigenvalue conjecture (which would slightly improve the estimate of Number Theory 
Result 4). Using Theorem 3 of Karatsuba [16], the author believes that our optimal 
parameter choices for Theorem 1 are Z = R = Q = X 1 / 2 and /i = Z~ 3 / 10 = X~ 3//20 , 
and this results in a bound no better than e sl/1 ° 

We finish by stating the following result, which can supply useful information when 
the sets A, C are on different scales but need not factor: 
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Theorem 4. Suppose that C C [X/2,X] and A C [Z/2,Z] are any sets of squarefree 
natural numbers, and that W < Z is any natural number. Then for any rj > 0, 

E E z2 lcw - au=1 f° rsomeu = E dT~)^ c G C : ( c,a ^ = - w - w '■ ( w ' a ) = 

aeA l<w<W ceC aeA ^ ' 



+0,((XZ)V(#^)(#C)^max{X/Z,Z}), 
where the constant implicit in the "big Oh" notation depends on rj only. 



The proof is similar to that from §3.3, but with one additional ingredient in bounding 
the error term. Thus we have 



E rh( n \ 



aeA 



X mod a, 

x^xo 



Ex( c ) 



cec 



E * 

Kw<W 



w 



< 



E f k( n \ E 



E 



Kw<W 



^ aeA 



X mod a, 

x¥=xq 



\r(xW 



\ a 



E rh(n\ E 



A 0( a ) 

eA X mod a, 

X^XQ 



cec 



\r(x)\ l 



6.4 X mod a, 

X^XQ 



E 



i<w<iy 



|r(x)P 



■(XZ)Vmax{X, Z 2 }#C, 

using the Cauchy-Schwarz inequality and Number Theory Result 2, and noting that 
r(x) never vanishes if x is a character to squarefree modulus. Seeking a non-trivial 
estimate for the sums under the first squareroot, we note they are 

IEi<™<wXW! 2 



E E ^^(a) Y 



2<U<Z X* mod y, aeA 
X* primitive 



«, * E E E^ 



X mod a, 
X induced by x* 



^<V<Z X* mod y, a£A 
X* primitive 



0(0.) 



E 



Ed|a I El<t)<W/d X* 



X mod a, 
X induced by x* 



E x ^ 

Kw<W 



on employing the simple sieve-type identity 

E *»= E x» = £m<o E **m 

l<u><W,(w,a)=l l<tu<W d|(iu,a) d\a l<v<W/d 

(where x is a character mod a, induced by y*, and denotes the Mobius function). 

If y divides a then any primitive character mod y will induce precisely one character 
mod a, whilst if y does not divide a it clearly cannot induce any. Thus the above is, 
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except for a Z v multiplier, 

1 ^2d\a\^2l<v<W/dX*( v )\ 2 

2^ ^^(aj 7, ^ 



2<y<z x* mod y, aeA 

X* primitive 



/ 



E^E ElEiE *»i 2 + e ! E i e 



\ 



6(a) '■ — ' '■ — ' v 

a£A^ x ' d\a \ 2<y<W, y x * mod y, l<v<W/d 

\ y |a primitive 

/ 



W<y<Z, & x * mod y, \<v<W / d 
y | a x* primitive 



s E^E E «v)**v+ E ; E i E 



ae.4 



d|a \ 2<y<W, 

\ y\ a 



V 

W<y<Z, a x * mod y, \<v<W / d 
y\a x* primitive 



X{v) 



J 



using Number Theory Result 1. Finally if we extend the summations over primitive 
characters to summations over all characters mod y, and use orthogonality, and apply 
a crude bound for the divisor function twice, we see the above is at most 



/ 



A 



E^yE E MWv+ E >t * E^E E E i 



ae.4 ^ v 7 da \ 2< y <w, 



W<y<Z, 



ae.4 



' y\°- 



W<y<Z, 

y\a 



«, ^E 



ae.4 



0(a) 



Since this is <^ v Z V (#A)W/Z , Theorem 4 follows. 



Q.E.D. 



The author did not try to incorporate the refinement in the above proof into the 
arguments of §3, and it is possible that it could lead to an improvement of those results. 
However, one could not directly combine it with the splitting of the set C in those 
arguments, (presumably one would have to use the Cauchy-Schwarz inequality again 
to separate | J2 1<w<w x( w )\/\ T (x)\ from everything else), and the author suspects that 
one would lose more in doing this than one would gain. 

The only obvious alternative approach to obtaining a result like Theorem 4 is to 
apply the additive circle method, in which case one would need to bound 

E E s,(h)J2<hc- l /a) 

aeA IM<i/m, ceC 

along with various other terms. As suggested in Shparlinski's paper (22], one could do 
this using Theorem 1 of Duke, Friedlander and Iwaniec [5], and indeed it is not obvious 
what other bound could be applied if neither of the sets A, C are assumed to factor. 
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However, if one applies that theorem one obtains that 



E E ^E^" 1 /") 



« v- E 



lftl<l/(i, 



a£A c£C 



<„ {XZyp-y/#A#C{VxTZ + min{X, Z}), 
/i 

and this is never better than the error bound in Theorem 4 on the whole range 1 < Z < 
X (and is appreciably poorer in the interesting case where W is small). 
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